AITopics | nonlinear activation function

Collaborating Authors

nonlinear activation function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Demystifying Oversmoothing in Attention-Based Graph Neural Networks

Neural Information Processing SystemsApr-28-2026, 13:11:06 GMT

Oversmoothing in Graph Neural Networks (GNNs) refers to the phenomenon where increasing network depth leads to homogeneous node representations. While previous work has established that Graph Convolutional Networks (GCNs) exponentially lose expressive power, it remains controversial whether the graph attention mechanism can mitigate oversmoothing. In this work, we provide a definitive answer to this question through a rigorous mathematical analysis, by viewing attention-based GNNs as nonlinear time-varying dynamical systems and incorporating tools and techniques from the theory of products of inhomogeneous matrices and the joint spectral radius. We establish that, contrary to popular belief, the graph attention mechanism cannot prevent oversmoothing and loses expressive power exponentially. The proposed framework extends the existing results on oversmoothing for symmetric GCNs to a significantly broader class of GNN models, including random walk GCNs, Graph Attention Networks (GATs) and (graph) transformers.

artificial intelligence, machine learning, matrix, (17 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

c7201deff8d507a8fe2e86d34094e154-Paper-Conference.pdf

Neural Information Processing SystemsFeb-18-2026, 02:22:26 GMT

artificial intelligence, machine learning, operator, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)
Information Technology > Communications (0.68)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.48)

Add feedback

Transformer as a hippocampal memory consolidation model based on NMDAR-inspired nonlinearity Dong-Kyum Kim

Neural Information Processing SystemsFeb-9-2026, 16:06:34 GMT

The hippocampus plays a critical role in learning, memory, and spatial representation, processes that depend on the NMDA receptor (NMDAR).

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > Japan > Kyūshū & Okinawa > Okinawa (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Workflow (0.67)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

where ℓ = 1,2,,L is the number of hidden layers (ψ(1)(ri) = ψ(ri) and L is the final layer), ReLU is the nonlinear activation function, W (ℓ) E RN N is the weight matrix in layer ℓ,and b

Neural Information Processing SystemsFeb-7-2026, 14:13:37 GMT

These molecular properties were calculated using a hybrid quantum simulation (Gaussian 09) at the B3LYP/6-31G(2df,p) level of theory. In this study, we created a subset of the QM9 dataset with a limited number of atoms, M 14, per molecule, which we refer to as the "QM9under14atoms" dataset in the main text. As the learning/predicting targets, we selected three kinds of energy properties: atomization energy at 0 K, zero point vibrational energy, and enthalpy at 298.15 K. E RN is the bias vector in layer ℓ. The LCAO considers the normalization for the coefficients in Eq. (6) in the main text. Additionally, the normalization term in Eq. (7) in the main text is calculated as follows: Z(qn,ζn)=

artificial intelligence, machine learning, nonlinear activation function, (10 more...)

Neural Information Processing Systems

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.07)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Spatially Parallel All-optical Neural Networks

Qin, Jianwei, Liu, Yanbing, Liu, Yan, Liu, Xun, Li, Wei, Ye, Fangwei

arXiv.org Artificial IntelligenceDec-1-2025

All-optical neural networks (AONNs) have emerged as a promising paradigm for ultrafast and energy-efficient computation. These networks typically consist of multiple serially connected layers between input and output layers--a configuration we term spatially series AONNs, with deep neural networks (DNNs) being the most prominent examples. However, such series architectures suffer from progressive signal degradation during information propagation and critically require additional nonlinearity designs to model complex relationships effectively. Here we propose a spatially parallel architecture for all-optical neural networks (SP-AONNs). Unlike series architecture that sequentially processes information through consecutively connected optical layers, SP-AONNs divide the input signal into identical copies fed simultaneously into separate optical layers. Through coherent interference between these parallel linear sub-networks, SP-AONNs inherently enable nonlinear computation without relying on active nonlinear components or iterative updates. We implemented a modular 4F optical system for SP-AONNs and evaluated its performance across multiple image classification benchmarks. Experimental results demonstrate that increasing the number of parallel sub-networks consistently enhances accuracy, improves noise robustness, and expands model expressivity. Our findings highlight spatial parallelism as a practical and scalable strategy for advancing the capabilities of optical neural computing.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2509.23611

Country: Asia > China (0.48)

Genre: Research Report > New Finding (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

The motion planning neural circuit in goal-directed navigation as Lie group operator search

Neural Information Processing SystemsOct-10-2025, 16:22:29 GMT

The present study investigates how brain's neural circuits search group operators

feedforward circuit, operator, representation, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
Asia > India (0.04)
Asia > China (0.04)

Genre: Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Robots > Robot Planning & Action (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

2f1eb4c897e63870eee9a0a0f7a10332-Paper-Conference.pdf

Neural Information Processing SystemsOct-8-2025, 09:34:39 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country: Asia > Japan > Kyūshū & Okinawa > Okinawa (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Workflow (0.67)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Graph Neural Network-Based Distributed Optimal Control for Linear Networked Systems: An Online Distributed Training Approach

Song, Zihao, Welikala, Shirantha, Antsaklis, Panos J., Lin, Hai

arXiv.org Artificial IntelligenceJul-23-2025

In this paper, we consider the distributed optimal control problem for discrete-time linear networked systems. In particular, we are interested in learning distributed optimal controllers using graph recurrent neural networks (GRNNs). Most of the existing approaches result in centralized optimal controllers with offline training processes. However, as the increasing demand of network resilience, the optimal controllers are further expected to be distributed, and are desirable to be trained in an online distributed fashion, which are also the main contributions of our work. To solve this problem, we first propose a GRNN-based distributed optimal control method, and we cast the problem as a self-supervised learning problem. Then, the distributed online training is achieved via distributed gradient computation, and inspired by the (consensus-based) distributed optimization idea, a distributed online training optimizer is designed. Furthermore, the local closed-loop stability of the linear networked system under our proposed GRNN-based controller is provided by assuming that the nonlinear activation function of the GRNN-based controller is both local sector-bounded and slope-restricted. The effectiveness of our proposed method is illustrated by numerical simulations using a specifically developed simulator.

artificial intelligence, controller, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2504.06439

Country: North America > United States (0.46)

Genre:

Research Report (0.64)
Instructional Material > Online (0.41)

Industry:

Energy (1.00)
Education > Educational Setting > Online (0.95)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Generative System Dynamics in Recurrent Neural Networks

Casoni, Michele, Guidi, Tommaso, Betti, Alessandro, Melacci, Stefano, Gori, Marco

arXiv.org Artificial IntelligenceApr-22-2025

In this study, we investigate the continuous time dynamics of Recurrent Neural Networks (RNNs), focusing on systems with nonlinear activation functions. The objective of this work is to identify conditions under which RNNs exhibit perpetual oscillatory behavior, without converging to static fixed points. We establish that skew-symmetric weight matrices are fundamental to enable stable limit cycles in both linear and nonlinear configurations. We further demonstrate that hyperbolic tangent-like activation functions (odd, bounded, and continuous) preserve these oscillatory dynamics by ensuring motion invariants in state space. Numerical simulations showcase how nonlinear activation functions not only maintain limit cycles, but also enhance the numerical stability of the system integration process, mitigating those instabilities that are commonly associated with the forward Euler method. The experimental results of this analysis highlight practical considerations for designing neural architectures capable of capturing complex temporal dependencies, i.e., strategies for enhancing memorization skills in recurrent models.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2504.13951

Country: Europe > Italy (0.14)

Genre: Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback

Filters

Collaborating Authors

nonlinear activation function

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Demystifying Oversmoothing in Attention-Based Graph Neural Networks

c7201deff8d507a8fe2e86d34094e154-Paper-Conference.pdf

6e4cdfdd909ea4e34bfc85a12774cba0-Paper-Conference.pdf

Transformer as a hippocampal memory consolidation model based on NMDAR-inspired nonlinearity Dong-Kyum Kim

where ℓ = 1,2,,L is the number of hidden layers (ψ(1)(ri) = ψ(ri) and L is the final layer), ReLU is the nonlinear activation function, W (ℓ) E RN N is the weight matrix in layer ℓ,and b

Spatially Parallel All-optical Neural Networks

The motion planning neural circuit in goal-directed navigation as Lie group operator search

2f1eb4c897e63870eee9a0a0f7a10332-Paper-Conference.pdf

Graph Neural Network-Based Distributed Optimal Control for Linear Networked Systems: An Online Distributed Training Approach

Generative System Dynamics in Recurrent Neural Networks